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. ■ ^ : I .. ^n TT S. p-rnvisional 

The present application cla.r»s P^"" ' 31 2000. 

cn/nq 110 which was filed January 
Application Number 60/179,110 wh 6,080,516 
The present application incorporates "-^-^^^^"^ 
and U S Applications Ser. Nos. 08/726,867, °8/728,963, 
0 30; 598 08/942,806, 60/109,302, and 09/276,533 and the r 
reirect'-i^olosures herein b. reference in their ent.ret.. 

1 0 FIELD OF THE INVENTION 

■ ^r. the field of molecular genetics. 
The present invention is m the tieia 

ine pj-co^ are aeneratea 

The application discloses .-^^ ^ /r . 

- p^ocess ----^ : ::r:hroLso.e. .he 

r;rra:i;=^^ r-^nant .ctor allows for the rapid 
ification of the ,ene that has been .utated us.n 

or a„ino acrd se.uenee . f r .ron .^rde.if.^t^^^ 

has been mutated by the vecuuj.. 
gene that has ^^^^ p^„^„^^ 

stem cell clones are produced, such described 
mutant animals capable of germline transmission 
mutated genes. 

2 0 BACKGROUND OF THE INVENTION 
, ..St mam^allan genes are J^rr I. and 

..ons are the portions of the l^H^^^- ,,,, ,,,30 

encode the protein product of a gene. y 

in, exons are often divided b. noncodin, intron sequences, 
.though polymerase transcribes both intron an e 

. the intron sequences must be removed from the 
° ; o the resulting m... can be -anslated into 

"ein Accordingly, all mammalian, and most eukaryotic, cells 

::r:::::rrrr!.r.. ...... 
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strong splice acceptor sequences and are not preceded by a 

^ ^ „^^^^v-c: Hntp.arate into a gene, the 

— ^^^^ 

Tene trap events are su.se,uentXy identified by seiectrn, .or 

no ^v.;,^ ran survive selective culture. 
="^r trapping .as generaiiy pro.en to .e an e".-- 
o. mutating large numbers o. genes. T.e insertron o tH ene 
trap vector creates a mutation in the trapped gene and ai so 
P.O ides a moiecuiar tag for ease o. identifying ; 
Ls been trapped. When ROS.Pgeo was used to trap ge es rt 

e.onstrated that at least SO, of the resulting 
.esulted in a phenotype when examined in .ice 
that the gene trap insertion vectors are useful .ut gens 
Although a powerful tool for mutating genes, the po entral 
m hod has historically been limited by the difficulty .n 

tifying the trapped genes. Methods that have been use to 
identify trap events rely on the fusion transcripts resultrng 

rr<=nomic DNA surrounding tne siLe ^ 

he rlthods have proven labor intensive, not readily amenable 
to automation, and generally impractical for high-throughput. 

More recently, vectors have been developed that rely on a 
new strategy of gene trapping that uses a ^^^^ ^^^^^^^ ; 

rs:^":rn: pridrselectL umess they rntegrate rnto 
agene and subsequently trap downstream axons which provrde a 
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polvadenylationl<,uence. Integration of vectors into the 

results in the splicing of the selectable Marker gene 
■qp vectors provide a number 



chromosome 



to 3' exons of the Liapped gone . 

of advantages. They can be used to trap genes regardless of 
5 Whether the genes are normally expressed in the cell type rn 

Which the vector has integrated, m addition, ---^^^^ 
such vectors can be screened using automated (e.g., 96-well plate 
format, gene identification assays such as 3' RACE (see 
generally, Frohman, 1994, PGR Methods and Applications, 4.S40- 
10 S58, . ■ using these vectors it is possible to produce large 
numbers of mutations and rapidly identify the mutated, or 
trapped, gene by DNA sequence analysis. 

3 0 SUMMARY OF THE INVENTION 
The subject invention provides numerous isolated ma^alian 
mutant cell clones that are each characterized by the insertron 
of a mutagenic genetically engineered polynucleotide seguenc 
into a gene identifiable as corresponding to one or more of 
OMNIBANK gene trapped sequences (GTSs) disclosed in Sequence 

" ""'rhe subject invention further contemplates a mutated cell, 
and particularly a mutated ES cell, and the animals derived from 
such ES cell that stably maintain a genetically engineered 
.utation in a gene identifiable as corresponding to one of 

25 disclosed GTSs. 

4 0 DESCRIPTION OF THE SEQUENCE LISTING AND FIGURES 
The sequence Listing is a compilation of nucleotide 
sequences obtained by sequencing clonal lines of gene trapped 

30 murine ES cells. 

Figures lA-lC present a diagrammatic representation 
representative gene trap vectors used to generate the described 
sequences . 
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I . ^ , ^ seaSRe Listing and the 

Fiaure 2 prides an index to the Sequ^uc 
eo^^elrondin, database accession nu..e.s .o. the .enes that have 
been mutated accor ding Lu the pre ^^nt invention. 

5 0 DETAILED DESCRIPTION OF THE INVENTION 

^■^r. relates to novel mutated mammalian 
The current invention relates to 

u ^K=.r-^rterized bv the insertion ot a 
cells that are each characterizea uy 

n^binant (i e , genetically engineered) mutagenic 
recombinant (i.e., g -H^ntifiable as corresponding 

polynucleotide sequence into a gene xdentifiable 
to one of the GTSs of SEQ ID NOS: 1-341. 

For the purposes of the present invention, the term 
ifiable" is to be construed as indicating that a mammalian 
identifiable -.pEScell has been mutated by the 

cell, and preferably, a murine ES cell, n 

Lai i. at least partially described in one of the GTSs o£ 

,uen.e Listln,. One .et.od o. 
ae crimed mutated .a^alian cells has a ^utatron xn a , ne 
, interest is by comparing the polynucleotide sequence (or a 
D interest j r- identifying the 

corresponding a^ino acid sequence, of the GTS rdent y 
^1 „ to the full length sequence of the gene. 

TBLASTX using the default parameters used, for 

:::-rr r;r.::';/r». 
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„u.ation in a ,1 i.enti.iea .V G.NB^HK oft.ES.Q .fo. example, 
accession number in Fig ure 2^ ^^^^ ^^^^ 

AS used herein, Lhe ucrmc mutarf^u ^ ^ 

genetic Xocus has been alte.e. b. a process invoiv.n, t.e 
,„.e raUon or incorporation of a ,ene.icaiiy en.rneer 
polynuoieotide sequence into the genome of the celi wxth the 
Tesuit that the subsequent Xeveis of activity of the product 
normally encoded by the locus is altered (i.e., reduced, 
increased or substantially ablated, . In those instances where 

he Tutatlon substantially completely disrupts the e.pressron or 
activity Of the product normally encoded by the locus re. 
null mutation,, a cell that is heterozygous for the .utated 
allele will typically produce about one half of the product of a 
n : tated cell (via a ,ene dosa.e effect,, and about twrce 
amount of product produced by a cell that is homozygous for 

""^re^tlrr-recombinantly manipulated" shall mean that such 

:r:::: — ^^^^^^ 

i„ vitro or e. vivo (see generally, Sambroo. et ai., 1989, 
Molecular Cloning, A Laboratory Manual, Cold Springs Harbor 
press N.Y.; and Ausubel et al., 1989, Current Protocols rn 
Molecular Biology, Green Publishing Associates and Wiley 

Interscience, N.Y.). o i e 

Where, the specifically exemplified mammalxan cells, x.e., 
lie (Tex-1 cells from murine strain A129) , are 
embryonic stem cells (Lex 1 

.stated by a process involving the insertion of at least 

or:::: of a ^neticauy engineered vector seguence into the gene 
of interest, the mutated embryonic stem cells can be 
30 1 injected into blastocysts which are subseguently rntroduced 
into ps udopregnant female hosts and carried to term usrn 

stablished methods such as those described in, for example, 
: ouse mutagenesis", 1993, Zambrowicz et ai., eds . , Lexrco 
press. The Woodlands, TX, and periodic updates thereof, herern 
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^1. o The resulting Wmeric animals are 

:::::::::: :?::p^-»^,.^r-:::r::::L. 

transmission ot an allele containiu.j 

the gene of interest. animals 
alternative method of producrng mutated cells 

i^fiPd aenes involves the process of 

those exempl^Lfied m U.S. Rpp entirety. Mutations 

is herein incorporated by reference « its entire y 

-r: ::;::r rr.":::::;r .:::r: r:r:.i 

knockout mutations, ^ ^ r-^n be 

is used to replace its murine orthologs) , can 

.... --rr.:;rr:rr„.e 
rr;"jr;rr:".r......... 

, that involve the substantial removal of vector 
.,a processes th at invo ^^^^^^^^^ ^^^^^^^^^^ 

encoded sequences (often sequences into the 

incorporation of the reco^binantly manipulated sequen 

genome . 

5 1 MUT.TED MAMMAIIAN CELLS OP THE PRESENT IWE.TION 
.hi presently described mutated cells have genetically 

Zb m LUC M . . -,y-p cells comprising 

.er +-v,^ -rvr-P^t^pnt invention are cexxo ^ ^ 

—.:.::r:: ......... 

« •< »• °' 

means of identityi g Typically, such genes will be 

reported in the sequence listing. Typically, 
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identifiable befte a disclosed GTS, or poWion thereof, shall 
hybridize to the gene under stringent conditions. 

By way of e . a mplo and n o t li m i tation, high stringency 

hybridization conditions can be defined as follows: 
Prehybridization of filters containing DNA to be screened rs 
carried out for 8 h to overnight at 65"C in a buffer containing 6X 
SSC, 50.M Trls-HCl ,PH 7.5), lr»^. EDTA, 0.02% PVP, 0.02% Ficoll, 
0 02% BSA, and 500 ,^g/ml denatured salmon sperm DNA. Filters are 
hybridized for 48 h at 65»C in prehybridization mixture contarnrng 
lOOw/ml denatured salmon sperm DNA and 5-20 x 10« cpm of "P- 
labeled probe (alternatively, as in all hybridizations described 
herein, approximately 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 
66 68 70, or about 72 degrees or more can be used) . The 
filters are then washed in approximately IX wash mix (lOX wash 
:„lx contains 3M NaCl, 0 . 6M Tris base, and 0.02M EDTA, 
alternatively, as with all washes described herein, 2X, 3X, 4X 
5X 6X wash mix, or more, can be used, twice for 5 minutes each ^ 
at room temperature, then in IX wash mix containing 1% SDS at 60 C 
(alternatively, as in all washes described herein, approximately 
42 44, 46, 48, 50, 52. 54, 56, 58, 62. 64, 66, 68, 70, or about 
72 degrees or more can be used, for about 30 mln, and finally rh 
0 3X wash mix (alternatively, as in all final washes described 
herein, approximately, 0.2X, 0.4X, 0.6X, O.SX, IX, or any 
concentration between about 2X and about 6X can be used in 
5 conjunction with a suitable wash temperature, containing 0.1, SOS 
at 60°C (alternatively, approximately 42. 44, 46, 48, 50, 52, 54, 
56 58 62, 64, 66, 68. 70, or about 72 degrees or more can be 
usld, for about 30 mln. The filters are then air dried and 
exposed to x-ray film for autoradiography. In an alternative 
0 protocol, washing of filters is done for 37°C for 1 h in a 

solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.01, 
BSA This is followed by a wash in O.IX SSC at 50-C for 45 mm 
before autoradiography. Another example of hybridization under 
highly stringent conditions is hybridization to filter-bound DNA 
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in 0.5 M sodium dodecyl sulfate Ws) , I rm EDTA at 

65°C, and washing in O.lxSSC/0.1% SDS at 68°C (Ausubel F.M. et 

al.r eds.. 1909, Current Protoc nl s in Molecular Biology, Vol. I, 

Greln Publishing Associates, Inc., and John Wiley & sons. Inc., 
New York, at p. 2.10.3). 

Alternatively, moderately stringent conditions can be used 
(e.g., washing in 0.2xSSC/0.1% SDS at 42° C (Ausubel et al., 1989, 
supra) . Moderately stringent conditions can be additionally 
defined, for example, as follows: Filters containing DNA are 
pretreated for 6 h at 55°C in a solution containing 6X BSC, 5X 
Denhart's solution, 0.5% SDS and 100 pg/ml denatured salmon sperm 
DNA. Hybridizations are carried out in the same solution and 5- 
20 X 10^ cpm 32p_iabeled probe is used. Filters are incubated rn 
hybridization mixture for 18-20 h at 55°C (alternatively, as in 
all hybridizations described herein, approximately 42, 44, 46, 
48 50 52, 54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees 
or' more can be used in combination with a suitable concentration 
of salt) . The filters are then washed in approximately IX wash 
mix (lOX wash mix contains 3M NaCl, 0 . 6M Tris base, and 0.02M 
EDTA, alternatively, as with all washes described herein, 2X, 3X, 
4X, 5X, 6X wash mix, or more, can be used) twice for 5 minutes 
each at room temperature, then in IX wash mix containing 1% SDS 
at 60°C (alternatively, as in all washes described herein, 
approximately, 42, 44, 46, 48, 50, 52, 54, 56, 58, 62, 64, 66, 
68, 70, or about 72 degrees or more can be used) for about 30 
min, and finally in 0.3X wash mix (alternatively, as in all final 
washes described herein approximately 0.2X, 0.4X, 0.6X, 0.8X, IX, 
or any concentration between about 2X and about 6X can be used xn 
conjunction with a suitable wash temperature) containing 0.1% SDS 
at 60°C (alternatively, approximately 42, 44, 45, 48, 50, 52, 54, 
56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can be 
usld) for about 30 min. The filters are then air dried and 
exposed to x-ray film for autoradiography. 
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In an alteWtive protocol, washing o^ilters is done twice 
for 30 minutes at 60°C in a solution containing IX SSC and 0.1% 
?nq v^^tf^.rH are blotted dry and exposed for autoradiography. 

Other conditions of moderate stringency which may be used 
are well-known in the art. For example, washing of filters can 
be done at 37°C for 1 h in a solution containing 2X SSC, 0.1% SDS. 
Another example of hybridization under moderately stringent 
conditions is washing in 0.2xSSC/0.1% SDS at 42°C (Ausubel et al., 
1989, supra). Such less stringent conditions may also be, for 
example, low stringency hybridization conditions. By way of 
example and not limitation, procedures using such conditions of 
low stringency are as follows (see also Shilo and Weinberg, 1981, 
Proc. Natl. Acad. Sci. USA 78:6789-6792): Filters containing DNA 
are pretreated for 6 h at 40°C in a solution containing 35% 
' formamide, 5X SSC, 50mM Tris-HCl (pH 7.5), 5mM EDTA, 0.1% PVP, 
0.1% Ficoll, 1% BSA, and 500 /ug/ml denatured salmon sperm DNA. 
Hybridizations are carried out in the same solution with the 
following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 
lOO^g/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 
X 10^ cpm 32p_iabeled probe is used. Filters are incubated in 
hybridization mixture for 18-20 h at 40°C (alternatively, as in 
all hybridizations described herein, approximately 42, 44, 46, 
48, 50, 52, 54, 56, 58, 62, 64,. 66, 68, 70, or about 72 degrees 
or more can be used) . The filters are then washed in 
approximately IX wash mix (lOx wash mix contains 3M NaCl, 0 . 6M 
Tris base, and 0.02M EDTA, alternatively, as with all washes 
described herein, 2X, 3X, 4X, 5X, 6X wash mix, or more, can be 
used) twice for five minutes each at room temperature, then in IX 
wash mix containing 1% SDS at 60°C (alternatively, as in all 
washes described herein, approximately 42, 44, 46, 48, 50, 52, 
54, 56, 58, 62, 64, 66, 68, 70, or about 72 degrees or more can 
be used) for about 30 min, and finally in 0 . 3X wash mix 
(alternatively, as in all final washes described herein, 
approximately, 0.2X, 0.4X, 0.6X, 0.8X, IX, or any concentration 
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between about 2=4nd about 6X can be used Conjunction w.th a 
suitable wash temperature, containing 0.1% SDS at 60 C 
.alternatively, a p prox i m ately 44 , .6, 48, 50, 52, 54, 5 , 

62 64 66, 68, 70, or about 72 degrees or more can be used) for 
5 ablut 30 mm. The filters are then air dried and exposed to x- 
„y film for autoradiography. In yet another alternative 
protocol, washing of filters is done for 1.5 h at 55»C in a 

. ■ ■ . ,x SSC 25mM Trls-HCl (pH 7.4), 5mM EDTA, and 
solution containing 2X SSC, /smii in t- 

0 1% SDS. The wash solution is replaced with fresh solution and 
1 1 t; h St 60°C Filters are then blotted 
10 incubated an additional 1.5 h at 60 C. 

dry and exposed for autoradiography. If necessary, ^ 
washed for a third time at 65-68"C and reexposed to fi m Other 
conditions of low stringency which may be used are well .nown in 
the art (e.g., as employed for cross-species hybridizations,. 
15 preferably, GTS variants identified or isolated using the above 
methods will also encode a functionally equivalent gene product 
(i e , protein, polypeptide, or domain thereof, encoding or 
otherwise associated with a function or structure at least 
partially encoded by the complementary GTS, . 

LOW stringency conditions are well .nown to those of s.ill 
in the art, and will vary predictably depending on the specific 
organisms from which the library and the labeled sequences are 
derived. Tor guidance regarding such conditions see, for 

V =)- fli 1989 Molecular Cloning, A Laboratory 
example, Sambroolc et al., 19Ma, 
25 Manual, Cold Springs Harbor Press, N.Y.; and Ausubel et al., 

1989, current Protocols in Molecular Biology, Green Publishing 
Associates and Wiley Interscience, N.Y. 

The identification of homologs. heterologs, or paralogs of 
SEQ ID HOS: 1-341 in other, preferably related, species can be 
30 useful for developing additional animal model systems that are 

Closely related to humans for purposes of drug discovery. Genes 
at other genetic loci within the genome that encode proteins 
Which have extensive homology to one or more domains °f ^^e gene 
products encoded by SEQ ID «0S: 1-341 can also be identified via 
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0 



25 



30 



similar techniq^. In the case of cDNA liHries, such 
screening techniques can identify clones derived from 

alreraaU.ely sp lic e d t r. n s rripts in the same or different 

species . •it 

Techniques useful to disrupt a gene in a cell and especrally 
an ES cell that may already have a disrupted gene are disclosed 
in copending US patent applications Nos. 08/726.867; 08/728,963, 
08/907,598; and 08/942,806, all of which are hereby incorporated 
herein by reference in their entirety, are within the scope of 
the current invention to disrupt a gene that encodes a 
polynucleotide of the current invention. 

5.2. USES OF THE DESCRIBED MUTATED GENES AND ANIMALS 
The described mutated cells and animals are used to 
investigate and define the cellular and biological functions of 
the mutated genes. Producing a scientific model that accurately 
accounts for the large number of genes, proteins, and 
macromolecules within a single cell has thus far proved beyond 
the capabilities of existing computer technology. It should thus 
not be surprising that the far more complex tas. of modeling the 
various intricacies, cross and direct redundancies, and 
interrelated functions of the various metabolic and catabolrc 
processes that occur within a single cell has also proven largely 
intractable to algorithmic methods of modeling and predictron. 
Even if one assumes that computer modeling of Inherently 
chaotic/heuristic processes will rapidly mature in the near 
future, such methods, at best, can only provide predictions that 
subsequently require practical validation. Several decades of 
empirical data have proven that mutant phenotypes provide a 
valuable source of such validation. 

The mutated diploid mammalian cells of the present invention 
will initially exist as mutated diploid cells that are 
heterozygous (except where genes on the X or Y chromosomes are 
mutated) for the mutations identified in the sequence listing. 
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fis such, via a ifce dosage" effect, the mufted cells can 
typically be characterized by the fact that they produce about 

one half of the mutated uiau..iipt/activity relative to cells 

having two nonmutated or wild type copies of the corresponding 
gene . 

When mutant animals are produced from the mutated cells, 
heterozygous animals capable of germline transmission of the 
mutated allele can be bred to produce embryos or offspring that 
are homozygous for the mutant allele. Such animals or embryos 
are a rich source of tissues and cells that do not express 
physiologically relevant amounts of the mutated genes or 
activities encoded thereby. Accordingly, an additional 
embodiment of the present invention are mutant cells and animals 
that have homozygous mutations in genes identifiable as 
corresponding to the GENBANK, or other database accession, 
numbers provided in Figure 2, or are identifiable as a homologs, 
paralog, or orthologs of a sequence provided in the Sequence 
Listing. 

in addition to providing important information regarding the 
functional role of a given gene in its nonmutated state (i.e., 
you learn about the function of the gene by discerning the 
effects of reducing or ablating the activity normally encoded by 
the gene), the described mutated cells and animals can be used as 
disease models, or in assays for compounds or genes (via gene 
delivery or transgenic methods) that compensate for the mutant 
phenotype and that can be used to treat diseases and disorders 
related to the observed phenotype. Alternatively, such products 
and genes can also be used to enhance desirable, if not normal, 
symptoms related to the observed phenotypes. 

The gene replacement /delivery therapies described above 
should be capable of delivering gene sequences to the cell types 
within patients which express the peptide or protein having the 
desired activity. 
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The examplPbelow are provided to illWtrate the subject 
invention. These examples are provided by way of illustration 

n o t included thf:. purpose of limiting the invention in 



and 



any way whatsoever. 



EXAMPLES 



6 1 GENERATION OF A LIBRARY OF MUTATED MOUSE ES CELLS DEFINED 
BY GTS SEQUENCES 

The retroviral vector VICTR 3, described in detail in U.S. 
application Ser. No. 08/728,963, filed October 11, 1996, was used 
to generate a library of gene trapped ES cell clones that 
represent a portion of the described GTSs. A plasmid containing 
the VICTR 3 cassette was constructed by conventional cloning 
techniques and designed to employ the features described above. 
Namely, the cassette contained a PGK promoter directing 
transcription of an exon that encodes the puro marker and ends in 
a canonical splice donor sequence. At the end of the puromycin 
exon, sequences were added as described that allow for the 
annealing of two nested PGR and sequencing primers. The vector 
backbone was based on pBluescript KS+ from Stratagene 
Corporation. 

The plasmid construct was linearized by digestion with Sea I 
which cuts at a unique site in the plasmid backbone. The plasmid 
was then transfected into the mouse ES cell line AB2 . 2 by 
electroporation using a BioRad Genepulser apparatus. After the 
cells were allowed to recover, gene trap clones were selected by 
adding puromycin to the medium at a final concentration of 3 
/.g/ml. Positive clones were allowed to grow under selection for 
approximately 10 days before being removed and cultured 
separately for storage and to determine the sequence of the 
disrupted gene. 

Total RNA was isolated from an aliquot of cells from each of 
18 gene trap clones chosen for study. Five micrograms of this 
RNA was used in a first strand cDNA synthesis reaction using the 
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' "RS" primer. tS primer has unique sequerft ( for subsequent 
PGR) on its 5- end and nine random nucleotides or nine T 

(Lhvmidlne) rco idu o c on i t ■ s V end. Reaction products from the 

first strand synthesis were added directly to a PGR with outer 
5 primers specific for the engineered sequences of puromycin and 
the "RS" primer. After amplification, an aliquot of reaction 
products were subject to a second round of amplification using 
primers internal, or nested, relative to the first set of PGR 
primers. This second amplification provided more reaction 
10 product for sequencing and also provided increased specificity 
for the specifically gene trapped DNA. 

The products of the nested PGR were visualized by agarose 
gel electrophoresis, and seventeen of the eighteen clones 
provided at least one band that was visible on the gel with 
15 ethidium bromide staining. Most gave only a single band which is 
an advantage in that a single band is generally easier to 
sequence. The PGR products were sequenced directly after excess 
PGR primers and nucleotides were removed by filtration in a spin 
column (Gentricon-100, Amicon) . DNA was added directly to dye 
20 terminator sequencing reactions (purchased from ABI) using the 

standard M13 forward primer a region for which was built into the 
end of the puro exon in all of the PGR fragments. 

subsequent studies have used both VIGTR 3 and VICTR 20. 
Like VICTR 3, VIGTR 20 is exemplary of a family of vectors that 
25 incorporate two main functional units: a sequence acquisition 
component having a strong promoter element (phosphoglycerate 
kinase 1) active in ES cells that is fused to the puromycin 
resistance gene (or other exon sequence) that is followed by a 
synthetic consensus splice donor (SD) sequence and lacks an 
30 operatively positioned polyadenylation sequence downstream from 
the SD sequence (PGKpuroSD) ; and 2) a mutagenic component that 
incorporates a splice acceptor sequence fused to a selectable 
and/or colorimetric marker gene and followed by a polyadenylation 
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sequence (for eHple, SAPgeopA, SAneopA, smESneopA, or 
SAIRESPgeopA) . 

Alco like VICTR .stop codons have been engineered into all 



three reading frames in the region between the 3' end of the 
selectable marker and the splice donor site. A diagrammatic 
description of structure and functions of VICTRs 3 and 20 is 
provided in Figure 1. 

When VICTRs 3, 20, and various variations thereof such as 
the vectors and methods described in U.S. Applications Ser. Nos. 
09/276,533, and 60/095,989 (the disclosures of which are herein 
incorporated by reference), were used in the commercial scale 
application of the presently disclosed invention, many 
mutagenized ES cell clones were rapidly engineered and obtained, 
sequence analysis obtained from these clones has identified a 
wide variety of sequences. Each of the sequences presented in 
SEQ ID NOS: 1-341 identify novel mutations in the coding regions 
of mammalian genes that identifiable as corresponding to the 
sequences presented in the Sequence Listing. Alternatively, the 
described mutated cells are described by the database (GENBANK, 
GENSEQ, etc.) accession numbers for the corresponding genes that 
have been mutated (see Figure 2). The described mutated cells, 
and preferably ES cells, provide a valuable resource for 
defining, evaluating, or validating the biological function or 
disease/pharmaceutical relevance of each of these genes. 

The cloned 3' RACE products resulting after the target ES 
cells were infected with one of the described gene trap vectors 
were purified using conventional column chromatography, (e.g., 
S300 and G-50 columns), and the products were recovered by 
centrifugation. Purified PGR products were quantified by 
fluorescence using PicoGreen (Molecular Probes, Inc., Eugene 
Oregon) as per the manufacturer's instructions. 

Dye terminator cycle sequencing reactions with AmpliTaq® FS 
DNA polymerase (Perkin Elmer Applied Biosystems, Foster City, CA) 
were carried out using approximately 7 pmoles of sequencing 
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primer, and app^imately 30-120 ng of 3' template, 
unincorporated dye terminators were removed from the completed 
-s equ e ncing r e actions u si ng r.-.SO columns as described above. The 
reactions were dried under vacuum, resuspended in loading buffer, 
and electrophoresed through a 6% Long Ranger acrylamide gel (FMC 
BioProducts, Rockland, ME) on an ABI Prism® 377 with XL upgrade 
as per the manufacturer's instructions. The sequences of the 
resulting amplicons, or GTSs, are described in SEQ ID NOS: 1-341. 

All publications and patents mentioned in the above 
specification are herein incorporated by reference. Various 
modifications and variations of the described method and system 
of the invention will be apparent to those skilled in the art 
without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the 
invention as claimed should not be unduly limited to such 
specific embodiments. Indeed, various modifications of the 
above-described modes for carrying out the invention which are 
obvious to those skilled in the field of molecular biology or 
related fields are intended to be within the scope of the 
following claims. 
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